Efficient Value Iteration Using Partitioned Models
نویسندگان
چکیده
In order to solve large-scale value iteration problems, more intelligent allocation of computing time is needed. We introduce the idea of an information frontier, which allows us to identify maximally productive regions of the problem space. We present a potential information flow metric which allows us to quantify the frontier precisely. We also introduce a partitioning scheme, which effectively combines with the flow metric to reduce the complexity of problematic operations. The framework is powerful, and can be used to parallelize valueiteration, effectively manage memory in large-scale problems, or further multi-agent cooperative solution methodologies. A complete algorithm is developed and successfully tested on several problems. Experimental evidence is presented which demonstrates the efficacy of the approach.
منابع مشابه
TR - 2009 - 024 Analysis and Optimization of Robin - Robin partitioned procedures in fluid - structure interaction problems
In the solution of Fluid-Structure Interaction problems, partitioned procedures are modular algorithms that involve separate fluid and structure solvers, that interact, in an iterative framework, through the exchange of suitable transmission conditions at the FS interface. In this work we study, using Fourier analysis, the convergence of partitioned algorithms based on Robin transmission condit...
متن کاملAnalysis and Optimization of Robin - Robin Partitioned
In the solution of Fluid-Structure Interaction problems, partitioned procedures are modular algorithms that involve separate fluid and structure solvers, that interact, in an iterative framework, through the exchange of suitable transmission conditions at the FS interface. In this work we study, using Fourier analysis, the convergence of partitioned algorithms based on Robin transmission condit...
متن کاملAnalysis and Optimization of Robin-Robin Partitioned Procedures in Fluid-Structure Interaction Problems
In the solution of Fluid-Structure Interaction problems, partitioned procedures are modular algorithms that involve separate fluid and structure solvers, that interact, in an iterative framework, through the exchange of suitable transmission conditions at the FS interface. In this work we study, using Fourier analysis, the convergence of partitioned algorithms based on Robin transmission condit...
متن کاملMonte carlo bayesian hierarchical reinforcement learning
In this paper, we propose to use hierarchical action decomposition to make Bayesian model-based reinforcement learning more efficient and feasible in practice. We formulate Bayesian hierarchical reinforcement learning as a partially observable semi-Markov decision process (POSMDP). The main POSMDP task is partitioned into a hierarchy of POSMDP subtasks; lower-level subtasks get solved first, th...
متن کاملRanking Efficient Decision Making Units Using Cooperative Game Theory Based on SBM Input-Oriented Model and Nucleolus Value
In evaluating the efficiency of decision making units (DMUs) by Data Envelopment Analysis (DEA) models, may be more than one DMU has an efficiency score equal to one. Since ranking of efficient DMUs is essential for decision makers, therefore, methods and models for this purpose are presented. One of ranking methods of efficient DMUs is cooperative game theory. In this study, Lee and Lozano mod...
متن کامل